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Abstract 


Studies of the spin and parity quantum numbers of the Higgs boson in the WW* —> ev/uv final 
state are presented, based on proton-proton collision data collected by the ATLAS detector at 
the Large Hadron Collider, corresponding to an integrated luminosity of 20.3 fb -1 at a centre - 
of-mass energy of Vs = 8 TeV. The Standard Model spin-parity J CP = 0 ++ hypothesis is 
compared with alternative hypotheses for both spin and CP. The case where the observed 
resonance is a mixture of the Standard-Model-like Higgs boson and CP-even ( J cp = 0 ++ ) or 
CP-odd ( J cp = 0 +_ ) Higgs boson in scenarios beyond the Standard Model is also studied. 
The data are found to be consistent with the Standard Model prediction and limits are placed 
on alternative spin and CP hypotheses, including CP mixing in different scenarios. 
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1 Introduction 


This paper presents studies of the spin and parity quantum numbers of the newly discovered Higgs 
particle [1, 2] in the WW* —> ev/iv final state, where only final states with opposite-charge, different- 
flavour leptons {e,n) are considered. Determining the spin of the newly discovered resonance and its 
properties under charge-parity (CP) conjugation is of primary importance to firmly establish its nature, 
and in particular whether it is the Standard Model (SM) Higgs boson or not. Compared to the previous 
ATLAS publication [3], this paper contains significant updates and improvements: the SM Higgs-boson 
hypothesis is compared with improved spin-2 scenarios. The case where the observed resonance 1 has 
J p = 1 + or 1“ is not studied in this paper as it is already excluded by previous publications both by the 
ATLAS [3] and CMS collaborations [4]. 

To simulate the alternative Higgs-boson hypotheses, the MADGRAPH5_aMC@NLO [5] generator is adop¬ 
ted. It includes terms of higher order (or|) in the Lagrangian, in contrast to the JHU [6, 7] event generator 
used in the previous publication [3]. In the context of this study, the 1-jet final state, which is more 
sensitive to contributions from the higher-order terms, is analysed, in addition to the 0-jet final state. 

Furthermore, the parity of the Higgs resonance is studied by testing the compatibility of the data with 
a beyond-the-Standard-Model (BSM) CP-even or CP-odd Higgs boson [8]. Finally, the case where the 
observed resonance is a mixed CP-state, namely a mixture of a SM Higgs boson and a BSM CP-even or 
CP-odd Higgs boson, is investigated. 

This study follows the recently published H —> WW* analysis [9] in the 0- and 1-jet channels with one 
major difference: the spin and parity analysis uses multivariate techniques to disentangle the various 
signal hypotheses and the backgrounds from each other, namely Boosted Decision Trees (BDT) [10]. 
The reconstruction and identification of physics objects in the event, the simulation and normalisation 
of backgrounds, and the main systematic uncertainties are the same as described in Ref. [9]. This paper 
focuses in detail on the aspects of the spin and parity analysis that differ from that publication. 

The outline of this paper is as follows: Sect. 2 describes the theoretical framework for the spin and parity 
analysis, Sect. 3 discusses the ATLAS detector, the data and Monte Carlo simulation samples used. The 
event selection and the background estimates are described in Sects. 4 and 5, respectively. The BDT 
analysis is presented in Sect. 6, followed by a description of the statistical tools used and of the various 
uncertainties in Sects. 7 and 8, respectively. Finally, the results are presented in Sect. 9. 


2 Theoretical framework for the spin and parity analyses 

In this section, the theoretical framework for the study of the spin and parity of the newly discovered 
resonance is discussed. The effective field theory (EFT) approach is adopted in this paper, within the 
Higgs characterisation model [8] implemented in the MADGRAPH5_aMC@NLO [5] generator. Different 
hypotheses for the Higgs-boson spin and parity are studied. Three main categories can be distinguished: 
the hypothesis that the observed resonance is a spin-2 resonance, a pure CP-even or CP-odd BSM Higgs 
boson, or a mixture of an SM Higgs and CP-even or CP-odd BSM Higgs bosons. The latter case would 
imply CP violation in the Higgs sector. In all cases, only the Higgs boson with a mass of 125 GeV is 
considered. In case of CP mixing, the Higgs boson would be a mass eigenstate, but not a CP eigenstate. 


1 In the following the abbreviated notation J p is used instead of J cp . 
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The approach used by this model relies on an EFT, which by definition is only valid up to a certain energy 
scale A. This Higgs characterisation model considers that the resonance structure recently observed 
corresponds to one new boson with J p = (E, 1* or 2 + and with mass of 125 GeV, assuming that any 
other BSM particle exists at an energy scale larger than A. The EFT approach has the advantage of being 
easily and systematically improvable by adding higher-dimensional operators in the Lagrangian, which 
effectively corresponds to adding higher-order corrections, following the same approach as that used in 
perturbation theory. The A cutoff scale is set to 1 TeV in this paper, to account for the experimental results 
from the LHC and previous collider experiments that show no evidence of new physics at lower energy 
scales. More details can be found in Ref. [8]. In the EFT approach adopted, the Higgs-boson couplings 
to particles other than W bosons are ignored as they would impact the signal yield with no effects on the 
H —> WW* decay kinematics, which is not studied in this analysis. 

This section is organised as follows. Higgs-like resonances in the framework of the Higgs characterisation 
model are introduced in Sects. 2.1.1 and 2.2.1, for spin-2 and spin-0 particles, respectively. The specific 
benchmark models under study are described in Sects. 2.1.2 and 2.2.2. 

2.1 Spin-2 theoretical model and benchmarks 

2.1.1 Spin-2 theoretical model 

Given the large number of possible spin-2 benchmark models, a specific one is chosen, corresponding 
to a graviton-inspired tensor with minimal couplings to the SM particles [11]. In the spin-2 boson rest 
frame, its polarisation states projected onto the parton collision axis can take only the values of ±2 for 
the gluon fusion (ggF) process and ± 1 for the qq production process. For the spin-2 model studied in this 
analysis, only these two production mechanisms are considered. The Lagrangian for a spin-2 minimal 
coupling model is defined as: 



( 1 ) 


p=v,f 


where T pv is the energy-momentum tensor, X pv is the spin-2 particle field and V and / denote vector bo¬ 
sons (Z, W , y and gluons) and fermions (leptons and quarks), respectively. The k p are the couplings of the 
Higgs-like resonance to particles, e.g. K q and K g label the couplings to quarks and gluons, respectively. 

With respect to the previous publication [3], the spin-2 analysis presented in this paper uses the Mad- 
GRAPH5_aMC@NLO [5] generator, which includes higher-order tree-level QCD calculations. As dis¬ 
cussed in the following, these calculations have an important impact on the Higgs-boson transverse mo¬ 
mentum Pj distribution, compared to the studies already performed using a Monte Carlo (MC) generator 
at leading order [6, 7]. In fact, when K q is not equal to K q (non-universal couplings), due to ordcr-o^ terms, 
a tail in the pi j? spectrum appears. 

For leading-order (LO) effects, the qq and ggF production processes are completely independent, but the 
beyond-LO processes contain diagrams with extra partons that give rise to a term proportional to (K q —K g ) 2 , 
which grows with the centre-of-mass energy squared of the hard process (s) as s 3 /(m 4 A 2 ) (where m is 
the mass of the spin-2 particle), and leads to a large tail at high values of pi 3 . The distributions of some 
spin-sensitive observables are affected by this tail. For a more detailed discussion, see Ref. [8]. This 
feature appears in final states with at least one jet, which indeed signals the presence of effects beyond 
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leading order. Therefore, the 1-jet category is analysed in addition to the 0-jet category in this paper, in 
order to increase the sensitivity for these production modes. Figure 1 shows the p !j? distribution for the Gl¬ 
and 1-jet final states at generator level after basic selection requirements (the minimum p-\ required for 
the jets used for this study is 25 GeV). Three different signal hypotheses are shown: one corresponding to 
universal couplings, k cj = K q , and two examples of non-universal couplings. The tail at high values of 
is clearly visible in the 1-jet category for the cases of non-universal couplings. 



Figure 1: The distribution of the transverse momentum of the Higgs boson, pi j!, at the Monte Carlo event-generator 
level for 0-jet (left) and 1-jet (right) final states. Three spin-2 signal hypotheses are shown: k cj - K q = 1, K g = 0.5, 
K q = 1 and K g — 1, K q =0. The last bin in each plot includes the overflow. 


This pj tail would lead to unitarity violation if there were no cutoff scale for the validity of the theory. 
By definition, in the context of the EFT approach, at a certain scale A, new physics should appear and 
correct the unitarity-violating behaviour, even below the scale A. There is a model-dependent theoretical 
uncertainty on the pj scale at which the EFT would be corrected by new physics: this uncertainty dictates 
the need to study benchmarks that use different pj cutoffs, as discussed in the following subsection. 


2.1.2 Choice of spin-2 benchmarks 

Within the spin-2 model described in the previous section, a few benchmarks, corresponding to a range of 
possible scenarios, are studied in this paper. In order to make sensible predictions for the spin-sensitive 
observables in the case of non-universal couplings, a cutoff on the Higgs-boson transverse momentum is 
introduced at a scale where the EFT is assumed to still be valid: this is chosen to be one-third of the scale 
A, corresponding to pj < 300 GeV. On the other hand, the lowest possible value up to which the EFT is 
valid by construction is the mass of the resonance itself; therefore it is important to study the effect of a 
threshold on pj at 125 GeV. 

Five different hypotheses are tested against the data: 

• universal couplings: K g = K q \ 

• K CJ = 1 and K q = 0, with two p^ cutoffs at 125 and 300 GeV; 

• Kg = 0.5 and K q - 1, with two p^ cutoffs at 125 and 300 GeV. 
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The case k cj = 0 and K q = I is not considered here, because it leads to a pi j? distribution which disagrees 
with the data, as shown in the H —> yy and H —> ZZ differential cross-section measurements [12, 13]. 


2.2 Spin-0 and CP-mixing theoretical models and benchmarks 
2.2.1 Spin-0 and CP-mixing theoretical models 

In the case where the spin of the Higgs-like resonance is zero, there are several BSM scenarios that predict 
the parity of the Higgs particle to be either even or odd [14]. Another interesting possibility is that the 
Higgs-like resonance is not a CP eigenstate, but a mixture of CP-even and CP-odd states. This would 
imply CP violation in the Higgs sector, which is possible in the context of the Minimal Supersymmetric 
Standard Model [15] or of two Higgs-doublet models [16]. This CP violation might be large enough to 
explain the prevalence of matter over antimatter in the universe. 

In the adopted EFT description, the scalar boson has the same properties as the SM Higgs boson, and 
its interactions with the SM particles are described by the appropriate operators. The BSM effects are 
expressed in terms of interactions with SM particles via higher-dimensional operators. 

The effective Lagrangian £ q adopted for this study, in order to describe the interactions of W bosons 
with scalar and pseudoscalar states, is expressed as: 

= jccAsM^HWW W+W~ p ] - ^Mhww W+ v W~ pv + S a K A ww w; v w~^ v ] 

- ^c a [( Kwm W + v 8 p W~ pv + h.c.)\}x 0 , (2) 

where W pv - d p W* - d v W ±, W pv = 1 /2 • e j)Vj) , T W pcr and e jlvp(T is the Levi-Civita tensor, while Xq represents 
the spin-0 Higgs-boson field [8]. In the SM, the coupling of the Higgs boson to the W bosons is given 
by feww. while the angle a describes the mixing between CP-even and CP-odd states. The notation 
c a = cos a ,s a = sin a is used in the Lagrangian. The dimensionless coupling parameters /c, are real 
and describe CP violation in the most general way. The parameter k sm describes the deviations of the 
Higgs-boson coupling to the vector boson W from those predicted by the SM, while /c AWW and % ww are 
the BSM CP-odd and CP-even coupling parameters, respectively. 2 The mixing between the CP-even 
SM Higgs boson and the CP-even BSM Higgs boson can be achieved by changing the relative strength 
of the couplings k sm and /c HWW . The cos a term multiplies both the SM and BSM CP-even terms in the 
Lagrangian and therefore its value does not change the relative importance of those contributions. This 
is different from the mixing of CP-even and CP-odd states, as a sin a term multiplies the CP-odd state in 
the Lagrangian. The last term of the Lagrangian is due to derivative operators which are relevant in the 
case one of the two vector bosons is off-shell. 

The higher-dimensional operator terms in the Lagrangian are the terms that contain /c AWW and /e HWW and 
are suppressed by a factor 1 /A. The SM Higgs boson is described by the first term of the Lagrangian, 
corresponding to the following choice of parameters: k sm = 1, /c AWW = a" hww = 0 and \c <r \ - 1. The deriv¬ 
ative operator (the term) described in the Lagrangian of Eq. (2) would modify the results below the 


2 The Lagrangian terms associated to the higher-dimensional operators are called in this paper BSM CP-even and BSM CP-odd 
Higgs bosons. 
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sensitivity achievable with the available data statistics. In fact, the effects on the kinematic distributions 
introduced by the derivative operator in the same range of variation of k hw W are at most 10-20% of the 
ones produced by k hww itself. Since the present analysis is barely sensitive to k hww , the even smaller k iuav 
variations are not studied further, and the corresponding term in the Lagrangian is neglected. 


2.2.2 Choice of CP benchmarks 

The following approach to study different CP hypotheses under the assumption of a spin-0 hypothesis is 
taken in this paper. First of all, in the fixed-hypothesis scenario, the cases where the observed resonance 
is a pure BSM CP-even or CP-odd Higgs boson are considered. In addition, the mixing between the CP- 
even SM and BSM CP-odd or CP-even Higgs bosons is studied. In the CP-odd case, the mixing depends 
on the value of k aww and on the mixing angle a. As can be deduced from Eq. (2), varying a or /e AWW has 
an equivalent effect on the kinematic variable distributions; therefore in this paper only the a parameter 
is varied while k aww is kept constant. The scan range of a covers the entire range from -7r/2 to n/2 as 
the final state kinematic distributions differ for positive and negative values of a. On the other hand, the 
mixing between the CP-even SM and CP-even BSM Higgs bosons depends exclusively on the value of 
k hww and not on the value of a. 

To summarise, four hypotheses are tested against the data in this paper (for the cutoff value A = 1 TeV): 

• Compare the SM Higgs-boson case with the pure BSM CP-even case, defined as k sm = 0, /c AWW = 0, 

%WW = lj C a — 1. 

• Compare the SM Higgs-boson case with the BSM CP-odd case, defined as k sm = 0, k aww = 1, 
%ww = 0 1 c a = 0. 

• Scan over tan a: under the assumption of a mixing between a CP-even SM Higgs boson and a CP- 
odd BSM Higgs boson. The mixing parameter is defined as (k AWW /K SM ) • tan a, where k AWW = (1/4) • 
(v/A) • k a ww, v is the vacuum expectation value and tana is the only variable term (corresponding 
to variations of c a between -1 and 1). The other parameters are set as follows: k sm = 1, k aww = 1, 

%WW = 0. 

• Scan over k hww : under the assumption of a mixing between a CP-even SM Higgs boson and a 
CP-even BSM Higgs boson. The mixing parameter is defined as k H ww/k S m, where k H ww = (1/4) - 
(v/A) ■ /e H ww and the only variable term is k hww (corresponding to variations of k HW w/^sm between 
-2.5 and 2.5). For larger values of this ratio, the kinematic distributions of the final-state particles 
asymptotically tend to the ones obtained in presence of a pure CP-even BSM Higgs boson. The 
latter is used as the last point of the scan. The other parameters are set as follows: k sm = 1, k aww = 0, 
c a — 1. 

In the case of CP-mixing, only one MC sample is generated (see Sect. 3), and all other samples are 
obtained from it by reweighting the events on the basis of the matrix element amplitudes derived from 
Eq. (2). The precision of this procedure is verified to be better than the percent level. The mixing 
parameters used to produce this sample are chosen such that the kinematic phase space for all CP-mixing 
scenarios considered here was fully populated, and the values of the parameters are: k sm = 1, k aww = 2, 
Khww = 2, C a = 0.3. 

In addition, it is interesting to study the case where the SM, the BSM CP-even and the CP-odd Higgs bo¬ 
sons all mix. Unfortunately, in the H —» WW* channel, the present data sample size limits the possibility 
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to constrain such a scenario, which would imply a simultaneous scan of two parameters tan a and r HWW . 
In particular this is due to the lack of sensitivity in the a- hww scan, consequently, as already stated, both the 
two and the three parameter scans, including in addition the derivative operators, are not pursued further. 
These studies are envisaged for the future. 


3 ATLAS detector, data and MC simulation samples 

This section describes the ATLAS detector, along with the data and MC simulation samples used for this 
analysis. 


3.1 The ATLAS detector 

The ATLAS detector [17] is a multipurpose particle detector with approximately forward-backward sym¬ 
metric cylindrical geometry and a near 47 r coverage in solid angle . 3 

The inner tracking detector (ID) consists of a silicon-pixel detector, which is closest to the interaction 
point, a silicon-strip detector surrounding the pixel detector, both covering up to |//| = 2.5, and an outer 
transition-radiation straw-tube tracker (TRT) covering |//| < 2. The ID is surrounded by a thin supercon¬ 
ducting solenoid providing a 2 T axial magnetic field. 

A highly segmented lead/liquid-argon (LAr) sampling electromagnetic calorimeter measures the energy 
and the position of electromagnetic showers over |?/| < 3.2. The LAr calorimeter includes a presampler 
(for \tj\ < 1.8) and three sampling layers, longitudinal in shower depth, up to |//| < 2.5. LAr sampling 
calorimeters are also used to measure hadronic showers in the end-cap (1.5 < I 77 I < 3.2) and both the 
electromagnetic and hadronic showers in the forward (3.1 < I 77 I < 4.9) regions, while an iron/scintillator 
tile sampling calorimeter measures hadronic showers in the central region (|//| < 1.7). 

The muon spectrometer (MS) surrounds the calorimeters and is designed to detect muons in the pseu¬ 
dorapidity range \r/\ < 2.7. The MS consists of one barrel (I 77 I < 1.05) and two end-cap regions. A system 
of three large superconducting air-core toroid magnets provides a magnetic field with a bending integral 
of about 2.5 T-m (6 T-m) in the barrel (end-cap) region. Monitored drift-tube chambers in both the barrel 
and end-cap regions and cathode strip chambers covering 2.0 < \rj\ < 2.7 are used as precision measure¬ 
ment chambers, whereas resistive plate chambers in the barrel and thin gap chambers in the end-caps are 
used as trigger chambers, covering up to |//| = 2.4. 

A three-level trigger system selects events to be recorded for offline analysis. The first-level trigger is 
hardware-based, while the higher-level triggers are software-based. 


3 The experiment uses a right-handed coordinate system with the origin at the nominal pp interaction point at the centre of 
the detector. The positive x-axis is defined by the direction from the origin to the centre of the LHC ring, the positive i/-axis 
points upwards, and the z-axis is along the beam direction. Cylindrical coordinates (r, <p) are used in the plane transverse to 
the beam, with (p the azimuthal angle around the beam axis. Transverse components of vectors are indicated by the subscript 
T. The pseudorapidity is defined in terms of the polar angle 6 as 77 = -In tan(0/2). The angular distance between two objects 
is defined as A R = -y/(A rj) 2 + (A (p) 2 . 
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3.2 Data and Monte Carlo simulation samples 

The data and MC simulation samples used in this analysis are a subset of those used in Ref. [9] with the 
exception of the specific spin/CP signal samples produced for this paper. 

The data were recorded by the ATLAS detector during the 2012 LHC run with proton-proton collisions 
at a centre-of-mass energy of 8 TeV, and correspond to an integrated luminosity of 20.3 fb _1 . This 
analysis uses events selected by triggers that required either a single high-px lepton or two leptons. Data 
quality requirements are applied to reject events recorded when the relevant detector components were 
not operating correctly. 

Dedicated MC samples are generated to evaluate all but the VV'+jcts and multi-jet backgrounds, which are 
estimated using data as discussed in Sect. 5. Most samples use the Powheg [18] generator, which includes 
corrections at next-to-leading order (NLO) in as for the processes of interest. In cases where higher par- 
ton multiplicities are important, Alpgen [19] or Sherpa [20] provide merged calculations at tree level for 
up to five additional partons. In a few cases, only leading-order generators (such as AcerMC [21] or 
gg2VV [22]) are available. Table 1 shows the event generator and production cross-section times branch¬ 
ing fraction used for each of the signal and background processes considered in this analysis. 

The matrix-element-level Monte Carlo calculations are matched to a model of the parton shower, un¬ 
derlying event and hadronisation, using either Pythia6 [23], Pythia8 [24], Herwig [25] (with the under¬ 
lying event modelled by Jimmy [26]), or Sherpa. Input parton distribution functions (PDFs) are taken 
from CT10 [27] for the Powheg and Sherpa samples and CTEQ6L1 [28] for the Alpgen+Herwig and 
AcerMC samples. The Drell-Yan (DY) sample (Z/y*+jets) is reweighted to the MRST PDF set [29]. 

The effects of the underlying event and of additional minimum-bias interactions occurring in the same 
or neighbouring bunch crossings, referred to as pile-up in the following, are modelled with Pythia 8, and 
the ATLAS detector response is simulated [30] using either Geant4 [31] or Geant 4 combined with a 
parametrised GEANT4-based calorimeter simulation [32]. 

For the signal, the ggF production mode for the H —> WW* signal is modelled with Powheg+Pythia8 [33, 
34] at mu = 125 GeV for the SM Higgs-boson signal in the spin-2 analysis, whereas MadGraph5_aMC @ 
NLO [5] is used for the CP analysis. The H + 0,1,2 partons samples are generated with LO accuracy, 
and subsequently showered with Pythia6. For the BSM signal, the MADGRAPH5_aMC@NLO generator 
is used in all cases. For the CP analysis, all samples (SM and BSM) are obtained by using the matrix- 
element reweighting method applied to a CP-mixed sample, as mentioned in Sect. 2.2.1, to provide a 
description of different CP-mixing configurations. The PDF set used is CTEQ6L1. To improve the 
modelling of the SM Higgs-boson px, a reweighting scheme is applied that reproduces the prediction 
of the next-to-next-to-leading-order (NNLO) and next-to-next-to-leading-logarithms (NNLL) dynamic- 
scale calculation given by the HRes2. 1 program [35, 36]. The BSM spin-0 Higgs-boson px is reweighted 
to the same distribution. 

Cross-sections arc calculated for the dominant diboson and top-quark processes as follows: the inclusive 
WW cross-section is calculated to NLO with MCFM [37]; non-resonant gluon fusion is calculated and 
modelled to LO in as with gg2VV, including both WW and ZZ production and then - interference; ft 
production is normalised to the calculation at NNLO in as, with resummation of higher-order terms to 
NNLL accuracy, evaluated with Top++2.0 [38]; single-top-quark processes are normalised to NNLL, 
following the calculations from Refs. [39, 40] and [41] for the ^-channel, f-channel, and Wt processes, 
respectively. 


Table 1: Monte Carlo samples used to model the signal and background processes. The corresponding cross- 
sections times branching fractions, cr ■ B, are quoted at yfs = 8 TeV. The branching fractions include the decays 
t — > Wb, W —> tv, and Z —> tt (except for the process ZZ —> tt vv). Here f refers to e, p, or r. The neutral current 
Z/y* —> tt process is denoted Z or y*, depending on the mass of the produced lepton pair. The parameters K g , K q are 
defined in Sect. 2.1.1, while k sm , *Ww. (t*ww, c a are defined in Sect. 2.2.1. 


Process 

MC generator 

Filter 

cr ■ S (pb) 

Signal samples used in J p = 2 + 
SM H —> VPfP* 

analysis 

Powheg+Pythia8 


0.435 

II 

MADGRAPH5_aMC @ NLO+Pythia6 


- 

o 

II 

II 

MADGRAPH5_aMC @ NLO+Pythia6 


- 

Kg — 0.5, Kg — 1 

MADGRAPH5_aMC @ NLO+Pythia6 


- 

Signal samples used in CP-mixing analysis 




%WW — 2, A"aWW — 2 

MADGRAPH5_aMC @ NLO+Pythia6 



Background samples 

WW 

qq -> WW and qg -> WW 

Powheg+Pythia6 


5.68 

gg —> WW 

gg2VV+Herwig 


0.196 

Top quarks 

ft 

Powheg+Pythia6 


26.6 

Wt 

Powheg+Pythia6 


2.35 

tqb 

AcerMC+Pythia6 


28.4 

tb 

Powheg+Pythia6 


1.82 

Other dibosons (VV) 

Wy 

Alpgen+Herwig 

p\> 8 GeV 

369 

Wy* 

Sherpa 

mu ^ 2 GeV 

12.2 

WZ 

Powheg+Pythia8 

niff > 1 GeV 

12.7 

Zy 

Sherpa 

p\> 8 GeV 

163 

Zy* 

Sherpa 

min. mu < 4 GeV 

7.31 

ZZ 

Powheg+Pythia8 

mu >4 GeV 

0.733 

ZZ —> tt vv 

Powheg+Pythia8 

mu >4 GeV 

0.504 

Drell -Yan 

Z/y* 

Alpgen+Herwig 

mu >10 GeV 

16500 


The WW background and the dominant backgrounds involving top-quark production (tt and Wt) are 
modelled using the Powheg+Pythia6 event generator [42-45]. For WW, WZ, and ZZ production via non- 
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resonant vector boson scattering, the Sherpa generator provides the LO cross-section and is used for event 
modelling. The negligible vector-boson-scattering (VBS) ZZ process is not shown in Table 1 but is in¬ 
cluded in the background modelling for completeness. The process Wy* is defined as associated W+Z/y* 
production, containing an opposite-charge same-flavour lepton pair with invariant mass mu less than 
7 GeV. This process is modelled using Sherpa with up to one additional parton. The range mu > 7 GeV is 
simulated with Powheg+Pythia8 and normalised to the Powheg cross-section. The use of Sherpa for Wy* 
is due to the inability of Powheg+Pythia8 to model invariant masses down to the production threshold. 
The Sherpa sample requires two leptons with pj > 5 GeV and | //1 < 3. The jet multiplicity is corrected 
using a Sherpa sample generated with 0.5 < met < 7 GeV and up to two additional partons, while the total 
cross-section is corrected using the ratio of the MCFM NLO to Sherpa LO calculations in the same re¬ 
stricted mass range. A similar procedure is used to model Zy *, defined as Z/y* pair production with one 
same-flavour opposite-charge lepton pan - having mu < 4 GeV and the other having mu > 4 GeV. 

The Wy and DY processes are modelled using Alpgen+Herwig with merged tree-level calculations of up 
to five jets. The merged samples are normalised to the NLO calculation of MCFM (for Wy) or the NNLO 
calculation of DYNNLO [46] (for DY). The Wy sample is generated with the requirements p y y > 8 GeV 
and A R(y, Y) > 0.25. 

A Sherpa sample is used to accurately model the Z(—» tl)y background. The photon is required to have 
Pj> 8 GeV and A R(y, () >0.1; the lepton pair must satisfy met >10 GeV. The cross-section is normalised 
to NLO using MCFM. Events are removed from the Alpgen+Herwig DY samples if they overlap with 
the kinematics defining the Sherpa Z (—> lt)y sample. 


4 Event selection 

The object reconstruction in terms of leptons, jets, and missing transverse momentum, as well as the 
lepton identification and isolation criteria, which were optimised to minimise the impact of the back¬ 
ground from misidentified isolated prompt leptons, are the same as described in detail in Ref. [9] : these 
aspects are therefore not discussed in this paper. The selection criteria and the analysis methodology used 
for the spin/CP studies described here are different however, since they are motivated not only by the 
need to distinguish the background processes from the Higgs-boson signal, but also by the requirement to 
optimise the separation power between different signal hypotheses. Thus, several selection requirements 
used in Ref. [9] are loosened or removed in the selection described below. 

This section is organised in four parts. First, the event preselection is described, followed by the dis¬ 
cussion of the spin- and parity-sensitive variables. These variables motivate the choice of topological 
selection requirements in the 0-jet and 1-jet categories described in the last two sections. All selection 
criteria are summarised in Table 2 and the corresponding expected and observed event yields are presented 
in Table 3. 


4.1 Event preselection 

The WW —> evpv final state chosen for this analysis consists of ep pairs, namely pairs of opposite-charge, 
different-flavour, identified and isolated prompt leptons. This choice is based on the expected better sens¬ 
itivity of this channel compared to the same-flavour channel, which involves a large potential background 
from Z/y* —> ee/pp processes. The preselection requirements are designed to reduce substantially the 


10 


dominant background processes to the Higgs-boson signal (see Sect. 5) and can be summarised briefly as 
follows: 

• The leading lepton is required to have p-y > 22 GeV to match the trigger requirements. 

• The subleading lepton is required to have pj > 15 GeV. 

• The mass of the lepton pair is required to be above 10 GeV. 

• The missing transverse momentum in the event is required to be p™ ss > 20 GeV. 

• The event must contain at most one jet with pj > 25 GeV and I/ 7 I < 4.5. The jet pj is required to 
be higher than 30 GeV in the forward region, 2.4 < |//| < 4.5, to minimise the impact of pile-up. 

This analysis considers only ep pairs in the 0-jet and 1-jet categories for the reasons explained in Sect. 1. 
Each category is analysed independently since they display rather different background compositions and 
signal-to-background ratios. 


4.2 Spin- and CP-sensitive variables 

The shapes of spin- and CP-sensitive variable distributions are discussed in this section for the preselected 
events. 

Figures 2 and 3 show the variables used to discriminate different spin-2 signal hypotheses from the SM 
Higgs-boson hypothesis for the 0-jet and the 1-jet category, respectively. For both the 0-jet and the 1-jet 
categories, the most sensitive variables arc p^ (transverse momentum of the dilepton system), mu, A (f>u 
{(p angle between the two leptons) and /?7 t (transverse mass of the dilepton and missing momentum sys¬ 
tem). These variables are the same as those used for the spin-2 analysis in the previous publication [3]. 

Similarly, Figs. 4 and 5 show the the variables that best discriminate between an SM Higgs boson and a 
BSM CP-even or CP-odd signal, respectively. The BSM CP-even variables are the same as those used in 
the spin-2 analysis, apart from the p™ ss variable which is substituted for m-y. The variables for the CP-odd 
analysis are mu, Euw, A pj, A (pa, where Euw = p^ - 0.5 p^ + 0.5/;!j. llss , and Pj are respectively 
the transverse momenta of the leading and subleading leptons, and Apx is the absolute value of their 
difference. 

The CP-mixing analysis studies both the positive and negative values of the mixing parameter, as ex¬ 
plained in Sect. 2.2.2. In the BSM CP-even benchmark scan, for negative values of the mixing para¬ 
meter, interference between the SM and BSM CP-even Higgs-boson couplings causes a cancellation that 
drastically changes the shape of the discriminating variable distributions. As an example, Fig. 6 shows 
the distribution of A <pu for the SM Higgs boson together with the distributions for several different values 
of the CP-mixing parameter. 

While for positive values of k hww /k su (Fig. 6 , left) and for the SM Higgs-boson hypothesis, the A (pa dis¬ 
tribution peaks towards low values, when reaching the maximum of the interference (at about k HWW //c SM ~ 
-1), the mean of the A (pee. distribution slowly moves towards higher values. This significantly improves 
the separation power between the SM and the BSM CP-even Higgs-boson hypotheses (Fig. 6 , right). For 
values of k HWW //e SM < - 1 , the peak of distribution gradually moves back to low values of A <pu, as in the 
case of the SM Higgs-boson hypothesis. The sum of the backgrounds is also shown on the same figure. 
The other CP-sensitive variables exhibit a similar behaviour in this specific region of parameter space. 
The impact of this feature on the results is discussed in Sect. 9.3. 
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Figure 2: Expected normalised Higgs-boson distributions of the transverse momentum of the dilepton system p^, 
the dilepton mass ma, the azimuthal angular difference between the leptons and and the transverse mass mj 
for the ep+ 0-jet category. The distributions are shown for the SM signal hypothesis (solid red line) and for three 
spin-2 hypotheses, namely J p = 2 + , K g — 0.5, K q — 1 (dashed yellow line), J p = 2 + , K g - 1, K q — 0 (blue dashed 
line) and J p = 2 + , K g — K q (green dashed line). The expected shapes for the sum of all backgrounds, including the 
data-derived VV’+jets background, is also shown (solid black line). The last bin in each plot includes the overflow. 


4.3 Event selection in the 0-jet and 1-jet categories 

Table 2 summarises the preselection requirements discussed in Sect. 4.1, together with the selections 
applied specifically to the 0-jet and 1 -jet categories. These selection requirements are optimised in terms 
of sensitivity for the different spin and CP hypotheses studied while maintaining the required rejection 
against the dominant backgrounds. In general, they are looser than those described in Ref. [9], which 
were optimised for the SM Higgs boson. 

Some of these looser selection requirements are applied to both the 0-jet and 1-jet categories: 

• The mass of the lepton pair - , mu, must satisfy mu < SO GeV, a selection which strongly reduces 
the dominant WW continuum background. 

• The azimuthal angle, Atpu, between the two leptons, must satisfy Acpu <2.8. 
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Figure 3: Expected normalised Higgs-boson distributions of p r .{, mu, Atpu and m-\ for the ep+ 1-jet category. The 
distributions are shown for the SM signal hypothesis (solid red line) and for three spin-2 hypotheses, namely 
J p = 2 + , K g = 0.5, K q = 1 (dashed yellow line), J p = 2 + , K g = 1, K q = 0 (blue dashed line) and J p = 2 + , K g = 
K q (green dashed line). The expected shapes for the sum of all backgrounds, including the data-derived Vf'+jets 
background, is also shown (solid black line). The last bin in each plot includes the overflow. 


Events in the 0-jet category are required to also satisfy p cp > 20 GeV, while events in the 1-jet category, 
which suffer potentially from a much larger background from top-quark production, must also satisfy the 
following requirements: 

• No ^-tagged jet [47] pj > 20 GeV is present in the event. 

• Using the direction of the missing transverse momentum a r-lepton pair can be reconstructed with 
a mass m TT by applying the collinear approximation [48]; m TT is required to pass the m TT < mz - 
25 GeV requirement to reject Z/y* —» rr events. 

• The transverse mass, m£, chosen to be the largest transverse mass of single leptons defined as 

nij = -yjlpjp'™ ss ( I - cos Ar/>), where A(f> is the angle between the lepton transverse momentum 
and /7| llss , is required to satisfy mi J, > 50 GeV to reject the VT +jcts background. 

• The total ttansverse mass of the dilepton and missing transverse momentum system, mj, is required 
to satisfy «7 t < 150 GeV. 
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Figure 4: Expected normalised Higgs-boson distributions of p\ f, niu. A <pu and the missing transverse momentum 
p™ ss for the eq+0-jet category. The distributions are shown for the SM signal hypothesis (solid red line) and for the 
BSM CP-even signal (dashed line). The expected shapes for the sum of all backgrounds, including the data-derived 
W +jets background, is also shown (solid black line). The last bin in each plot includes the overflow. 


For alternative spin-2 benchmarks with non-universal couplings, as listed in Sect. 2.1.2, an additional 
requirement on the reconstructed Fliggs-boson transverse momentum pi j? is applied in the signal and 
control regions for all MC samples and data. The p !jf variable is reconstructed as the transverse component 
of the vector sum of the four-momenta of both leptons and the missing transverse energy. 

Table 3 shows the number of events for data, expected SM signal and the various background compon¬ 
ents after event selection. The background estimation methods are described in detail in Sect. 5. Good 
agreement is seen between the observed numbers of events in each of the two categories and the sum of 
the total background and the expected signal from an SM Fliggs boson. The 0-jet category is the most 
sensitive one with almost three times larger yields than the 1-jet category. As expected, however, the 
requirements on p affect mostly the 1-jet category, which is sensitive to possible tails at high values of 
Pj, as explained in Sect. 2.1.2. Figures 7 and 8 show the distributions of discriminating variables used 
in the analysis after the full selection for the 0-jet and 1-jet categories, respectively. These figures show 
reasonable agreement between the data and the sum of all expected contributions, including that from the 
SM Fliggs boson. 
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Figure 5: Expected normalised Higgs-boson distributions of mu, the Euw variable defined in Sect. 4.2, the differ¬ 
ence between the transverse momenta of the leading and subleading leptons A pj and A <pu for the e/r+0-jet category. 
The distributions are shown for the SM signal hypothesis (solid red line) and for the BSM CP-odd signal (dashed 
line). The expected shapes for the sum of all backgrounds, including the data-derived VP+jets background, is also 
shown (solid black line). The last bin in each plot includes the overflow. 


5 Backgrounds 


The background contamination in the signal region (SR) is briefly discussed in the previous section. This 
section is dedicated to a more detailed description of backgrounds and their determination. The following 
physics processes relevant for this analysis are discussed: 

• VPVP: non-resonant W-boson pair production; 

• top quarks (labelled as Top): top-quark pah - production (ft) and single-top-quark production (t); 

• misidentified leptons (labelled as VP+jets): VP-boson production, in association with a jet that is 
misidentified as a lepton, and dijet or multi-jet production with two misidentifications; 

• Z/y* decay to rr final states. 
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Figure 6: Expected normalised Higgs-boson distributions of A (f>{[ for the e/i+ 0-jet category. The distributions are 
shown for the SM signal hypothesis (solid red line) and for different mixing hypotheses of the SM Higgs and 
CP-even BSM Higgs bosons, corresponding to positive (left) and negative (right) values of the mixing parameter 
khww/ksm (abbreviated to k in the legend). The expected shapes for the sum of all backgrounds, including the 
data-derived VV'+jets background, is also shown (solid black line). The last bin in each plot includes the overflow. 


Other smaller backgrounds, such as non -WW dibosons (Wy, Wy*, WZ and ZZ) labelled as VV in the 
following, as well as the very small Z/y* —> ee or mi contribution, are estimated directly from simulation 
with the appropriate theoretical input as discussed in Sect. 3. 

The dominant background sources are normalised either using only data, as in the case of the W+jets 
background, or using data yields in an appropriate control region (CR) to normalise the MC predictions, 
as for WW, Z/y* —> rr and top-quark backgrounds. The event selection in control regions is orthogonal 
to the signal region selection but as close as possible to reduce the extrapolation uncertainties from the 
CRs to the SR. The requirements that define these regions are listed in Table 4. 

The control regions, for example the WW CR, are used to determine a normalisation factor, fi, defined 
by the ratio of the observed to expected yields of WW candidates in the CR, where the observed yield is 
obtained by subtracting the non -WW contributions from the data. The estimate for the background 
under consideration, in the SR, can be written as: 

#sr = ^sr Ncr/Bcr = Nqr Bsr/Bcr , (3) 

Normalisation /3 Extrapolation a 


where (Vcr and Z?cr are the observed yield and the MC estimate in the CR, respectively, and B$r is 
the MC estimate in the signal region. The parameter (3 defines the data-to-MC normalisation factor 
in the CR, while the parameter a defines the extrapolation factor from the CR to the SR predicted by 
the MC simulation. With enough events in the CR, the large theoretical uncertainties associated with 
estimating the background only from simulation are replaced by the combination of two significantly 
smaller uncertainties: the statistical uncertainty on Nqr and the systematic uncertainty on a. 

The extrapolation factor a has uncertainties which are common to all MC-sinrulation derived back¬ 
grounds: 
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Figure 7: Expected and observed distributions of p^, mu, A<pu, w?t, p“ lss , A pj and Eu vv for the 0-jet category. 
The shaded band represents the systematic uncertainties described in Sects. 5 and 8. The signal is shown assuming 
an SM Higgs boson with mass mu = 125 GeV. The backgrounds are normalised using control regions defined in 
Sect. 5. The last bin in each plot includes the overflow. 
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Table 2: List of selection requirements in the signal region adopted for both the spin and CP analyses. The pJj? 
selection requirement (*) is applied to all samples when testing the spin-2 benchmarks with non-universal couplings. 


Variable 

Requirements 

Preselection 

^leptons 

Exactly 2 with pj > 10 GeV, ep, opposite sign 

pi 

> 22 GeV 

h 

Pi 

> 15 GeV 

m u 

> 10 GeV 

n miss 

Pj 

> 20 GeV 

0-jet selection 

ff 

Pj 

> 20 GeV 

mu 

< 80 GeV 

A<Pet 

< 2.8 

Pj 

< 125 or 300 GeV (*) 

1-jet selection 

b-ve to 

No ft-jets with pj > 20 GeV 

TYItt 

< m z - 25 GeV 

f 

irij 

> 50 GeV 

mu 

< 80 GeV 

A<f>u 

< 2.8 

mj 

< 150 GeV 

Pt 

< 125 or 300 GeV (*) 


• uncertainty due to higher perturbative orders in QCD not included in the MC simulation, evaluated 
by varying the renormalisation and factorisation scales by factors one-half and two; 

• uncertainty due to the PDF choice, estimated by taking the largest difference between the nominal 
PDF set (e.g. CT10) and two alternative PDF sets (e.g. MSTW2008 [49] and NNPDF2.3 [50]), with 
the uncertainty determined from the error eigenvectors of the nominal PDF set added in quadrature; 

• uncertainty due to modelling of the underlying event, hadronisation and parton shower (UE/PS), 
evaluated by comparing the predictions from the nominal and alternative parton shower models, 
e.g. Pythia and Herwig. 

The section is organised as follows. Section 5.1 describes the WW background - the dominant back¬ 
ground in both the 0- and 1-jet categories. Section 5.2 describes the background from the top-quark 
production, the second largest background in the 1-jet category. The Z/y* —> t + t~ background is de- 
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Table 3: Expected event yields in the signal regions (SR) for the 0- and 1-jet categories (labelled as Oj and lj, 
respectively). For the dominant backgrounds, the expected yields are normalised using the control regions defined in 
Sect. 5. The expected contributions from various processes are listed, namely the ggF SM Higgs-boson production 
(Nggr), and the background contribution from WW ( Nww ), top quark (top-quark pairs Ng, and single-top quark 
N t ), Drell-Yan Z/y* to rr (IVdy.tt), misidentified leptons (Nw+]ets), WZ/ZZ/Wy (Nyy) and Drell-Yan Z/y* to 
ee/pp (IVdy.sf)- The total sum of the backgrounds (IVbkg) is also shown together with the data. Applying the 
Pj requirement in the 0-jet category does not change substantially the event yields, while it has an effect in the 1-jet 
category, as expected. The errors on the ratios of the data over total background, IVbkg, only take into account the 
statistical uncertainties on the observed and expected yields. 



Nggp 

Nww 

N„- 

N t 

-YdY.tt 

^W+jets 

Nyy 

IVdy.sf 

^bkg 

Data 

Data/Vbkg 

OjSR 

218 

2796 

235 

135 

515 

366 

311 

32 

4390 

4730 

1.08 + 0.02 

lj SR: 

77 

555 

267 

103 

228 

123 

131 

5.8 

1413 

1569 

1.11 ± 0.03 

lj SR: < 300 GeV 

77 

553 

267 

103 

228 

123 

131 

5.8 

1411 

1567 

1.11 ± 0.03 

lj SR: < 125 GeV 

76 

530 

259 

101 

224 

121 

128 

5.8 

1367 

1511 

1.11 ± 0.03 


Table 4: List of selection criteria used to define the orthogonal control regions for WW, top-quark and Z/y* —» tt 
backgrounds. 


Control region 

Selection 

WW CR 0-jet 

WW CR-1 jet 

Preselection, p r T c > 20 GeV, 80 < mu <150 GeV 

Preselection, 6-veto, m TT < my - 25 GeV 
ml y > 50 GeV, mu > 80 GeV 

Top CR 0-jet 

Preselection, A <f>u <2.8, all jets inclusive 

Top CR 1 -jet 

At least one 6-jet, m TT < my - 25 GeV 

Z/y* —» rr CR 0-jet 

Preselection, mu < 80 GeV, A<% > 2.8 

Z/y* -» rr CR 1-jet 

Preselection, 6-veto, m ( T > 50 GeV, mu < 80 GeV, | m TT - m-/\ < 25 GeV 


scribed in Sect. 5.3, while the data-derived estimate of the tP+jets background is briefly described in 
Sect. 5.4. The extrapolation factor uncertainties are summarised in Table 5. More details can be found in 
Ref. [9], 


5.1 Non-resonant W-boson pairs 

Non-resonant IV-boson pair production is the dominant (irreducible) background in this analysis. Only 
some of the kinematic properties allow resonant and non-resonant production to be distinguished. The 
WW background is normalised using a control region which differs from the signal region in having a 
different range of dilepton invariant mass, mu- The leptons from non-resonant WW production tend to 
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Figure 8: Expected and observed distributions of p^, mu, M>ff and m '\ for the 1-jet category. The shaded band 
represents the systematic uncertainties described in Sects. 5 and 8. The signal is shown assuming an SM Higgs 
boson with mass nin = 125 GeV. The backgrounds are normalised using control regions defined in Sect. 5. The last 
bin in each plot includes the overflow. 



have a larger opening angle than the resonant WW production. Furthermore, the Higgs-boson mass is 
lower than the mass of the system formed by the two W bosons. Thus, the non-resonant WW background 
is dominant at high met values. 

The 0-jet WW control region is defined after applying the Pj criterion by changing the mu requirement 
to 80 < mu < 150 GeV. The 1-jet WW control region is defined after the m c T criterion by requiring 
nice > 80 GeV. The purity of the WW control region is expected to be 69% in the 0-jet category and 
43% in the 1-jet category. Thus, the data-derived normalisation of the main non-WW backgrounds, 
the top-quark and Drell-Yan backgrounds, is applied in the VVTV' CR as described in the following two 
subsections. Other small backgrounds are normalised using MC simulation. The CR normalisation is 
applied to the combined WW estimate independently of the production (qq, qq or qq) process. The A(/>//■ 
and mu distributions in the WW control region are shown in Fig. 9 for the 0-jet and 1-jet final states. 

Apart from the sources discussed in the previous section, the extrapolation factor a has uncertainties due 
to the generator choice, estimated by comparing the Powheg+Herwig and aMC@NLO+HERWiG generat¬ 
ors, and due to higher-order electroweak corrections determined by reweighting the MC simulation to the 
NLO electroweak calculation. All uncertainties are summarised in Table 5. 
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Table 5: Theoretical uncertainties (in %) on the extrapolation factor a for WW, top-quark and Z/y* —> tt back¬ 
grounds. "Total" refers to the sum in quadrature of all uncertainties. The negative sign indicates anti-correlation 
with respect to the unsigned uncertainties for categories in the same column. The uncertainties on the top-quark 
background extrapolation factor in the 0-jet category are discussed in Sect. 5.2. 


Category 

Scale 

PDF Gen 

EW 

UE/PS 

pf 

Total 

WW background 

SR 0-jet 

0.9 

3.8 6.9 

-0.8 

-4.1 

- 

8.2 

SR 1-jet 

1.2 

1.9 3.3 

-2.1 

-3.2 

- 

5.3 

Top-quark background 

SR 1-jet 

-0.8 

-1.4 1.9 

- 

2.4 

- 

3.5 

WW CR 1-jet 

0.6 

0.3 -2.4 

- 

2.0 

- 

3.2 

Z/y* —> tt background 

SR 0-jet 

-7.1 

1.3 

- 

-6.5 

19 

21.3 

SR 1-jet 

6.6 

0.66 

- 

-4.2 

- 

7.9 

WW CR 0-jet 

-11.4 

1.7 

- 

-8.3 

16 

21.4 

WW CR 1-jet 

-5.6 

2.2 

- 

-4.8 

- 

7.7 


5.2 Top quarks 

The top-quark background is one of the largest backgrounds in this analysis. Top quarks can be produced 
in pairs (tt) or individually in single-top processes in association with a W boson (Wt) or lighter quark(s) 
(single-/). The top-quark background normalisation from data is derived independently of the production 
process. 

For the 0-jet category, the control region is defined by applying the preselection cuts including the missing 
transverse momentum threshold, with an additional requirement of Ad)f t < 2.8 to reduce the Z/y* —> rr 
background. The top-quark background 0-jet CR is inclusive in the number of jets and has a purity of 
74%. The extrapolation parameter a is determined as described in Eq. (3). The value of a is corrected 
using data in a sample containing at least one ^-tagged jet [9]. 

The resulting normalisation factor is 1.08 + 0.02 (stat.). The total uncertainty on the normalisation factor 
is 8.1%. The total uncertainty includes variations of the renormalisation and factorisation scales, PDF 
choice and parton shower model. Also the uncertainty on the tt and Wt production cross-sections and 
on the interference of these processes is included. An additional theoretical uncertainty is evaluated on 
the efficiency of the additional selection after the jet-veto requirement. Experimental uncertainties on the 
simulation-derived components are evaluated as well. 

In the 1 -jet category, the top-quark background is the second leading background, not only in the signal 
region but also in the WW control region, where the contamination by this background is about 40%. 
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Figure 9: The A tj>d and met distributions in the WW control region, for the 0-jet (top) and 1-jet (bottom) categories. 
The signal is shown assuming an SM Higgs boson with mass mn = 125 GeV. The signal contamination is negligible 
for the SM as well as for the alternative hypotheses. The normalisation factors from the control regions described 
in Sect. 5 are applied. The last bin in each plot includes the overflow. 


Thus two extrapolation parameters are defined: asR for the extrapolation to the signal region and aww 
for the extrapolation to the WW control region. The 1-jet top-quark background control region is defined 
after the preselection and requires the presence of exactly one jet, which must be ^-tagged. Events with 
additional ^-tagged jets with 20 < pj < 25 GeV are vetoed, following the SR requirement. Selection 
criteria on mi r and m TT veto are applied as well. The A </>w and mu distributions in the 1-jet CR are shown 
in Fig. 10. 

The extrapolation uncertainty is estimated using the above mentioned sources of theoretical uncertainties 
and the additional uncertainties specific to the top-quark background: ft and single-top cross-sections and 
the interference between single and pair production of top quarks. A summary of the uncertainties is 
given in Table 5. 


5.3 Drell-Yan 

The Drell-Yan background is dominated by Z/y* —> tt events with r-leptons decaying leptonically. The 
Z/y* —> tt 0-jet control region is defined by applying the preselection requirements, adding mu < 80 GeV 


22 







































0 0.5 1 1.5 2 2.5 3 


A(/> u [rad] 



50 100 150 200 250 

m n [GeV] 


Figure 10: The A(f>u and mu distributions in the top-quark background control region for the 1-jet category. The 
signal is shown assuming an SM Higgs boson with mass m H = 125 GeV. The signal contamination is negligible for 
the SM as well as for the alternative hypotheses. The normalisation factors from the control regions described in 
Sect. 5 are applied. The last bin in each plot includes the overflow. 


and reversing the A<pu criterion, Acb/f > 2.8. The purity of this control region is expected to be 90%. The 
Z/y* —* tt 1-jet control region is defined by applying the preselection requirements, b- veto, m f T > 50 GeV 
as in the signal region but requiring |m TT - n?zl < 25 GeV. The purity of the 1-jet control region is about 
80%. 

The Z/y* —> tt predictions in the 0- and 1-jet categories are estimated using the extrapolation from the 
control region to the signal region and to the WW control region, as there is a 4 -5% contamination of 
Z/y* —> tt events in the WW control region. The Acbu and mu distributions in the Z/y* —> tt control 
region are shown in Fig. 11 for the 0-jet and 1-jet final states. 

A mismodelling of the transverse momentum of the Z boson p^,, reconstructed as Pj, is observed in the 
DY-enriched region. The mismodelling is more pronounced in the 0-jet category. The Alpgen+Herwig 
MC generator does not adequately model the parton shower of the soft jets which balance p 1 /^ in events 
with no selected jets. A correction, based on weights derived from a data-to-MC comparison in the Z 
mass peak, is therefore applied to MC events in bins of p^ in the 0-jet category. The weights are applied 
to Pj at generator-level for all lepton flavour decays. 

Apart from the above mentioned sources of theoretical uncertainties, one additional uncertainty on the Pj- 
reweighting in the 0-jet category is estimated by comparing the difference between the nominal (derived 
in the Z mass peak) and the alternative (derived in the Z mass peak but after the /;! [ T1ISS > 20 GeV criterion) 
set of weights. All uncertainties are summarised in Table 5. 


5.4 Misidentified leptons 

The VV'+jcts background is estimated in the same way as in Ref. [9], where a detailed description of the 
method can be found. The VV'+jcts control sample contains events where one of the two lepton candidates 
satisfies the identification and isolation criteria for the signal sample, and the other lepton fails to meet 
these criteria but satisfies less restrictive criteria (these lepton candidates are called "anti-identified”). 
Events in this sample are otherwise required to satisfy all of the signal selection requirements. The 
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Figure 11: The A tpa and in ft distributions in the Z/y* —> rr control region, for the 0-jet (top) and 1-jet (bottom) 
categories. The signal is shown assuming an SM Higgs boson with mass mn = 125 GeV. The signal contamination 
is negligible for the SM as well as for the alternative hypotheses. The normalisation factors from the control regions 
described in Sect. 5 are applied. 



dominant component of this sample (85% to 90%) is due to VV' +jcts events in which a jet produces an 
object reconstructed as a lepton. This object may be either a non-prompt lepton from the decay of a 
hadron containing a heavy quark, or a particle (or particles) originating from a jet and reconstructed as a 
lepton candidate. 

The W+jets contamination in the signal region is obtained by scaling the number of events in the data 
control sample by an extrapolation factor. This extrapolation factor is measured in a data sample of 
jets produced in association with Z bosons reconstructed in either the ee or pp final state (referred to as 
the Z+jets control sample below). The factor is the ratio of the number of identified lepton candidates 
satisfying all lepton selection criteria to the number of anti-identified leptons measured in bins of anti¬ 
identified lepton p'\- and //. Each number is corrected for the presence of processes other than Z+jets. 

The composition of the associated jets - namely the fractions of jets due to the production of heavy-flavour 
quarks, light-flavour quarks and gluons - in the Z+jets sample and the W+jets sample are different. Monte 
Carlo simulation is used to correct the extrapolation factors and to determine the associated uncertainty. 
Other important uncertainties on the Z+jets extrapolation factor are due to the limited number of jets that 
meet the lepton selection criteria in the Z+jets control sample and the uncertainties on the contributions 
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from other physics processes. 


The total systematic uncertainty on the corrected extrapolation factors varies as a function of the pj of 
the anti-identified lepton; this variation is from 29% to 61% for anti-identified electrons and 25% to 46% 
for anti-identified muons. The systematic uncertainty on the corrected extrapolation factor dominates the 
systematic uncertainty on the IV + jets background. 


6 BDT analysis 

Both the spin and the CP analysis employ a BDT algorithm 4 to distinguish between different signal 
hypotheses. In all cases, two discriminants are trained to separate the signals from each other, or from the 
various background components, using the discriminating variables described in Sect. 4.2. The resulting 
two-dimensional BDT output is then used to construct a binned likelihood, which is fitted to the data 
to test its compatibility with the SM or BSM Higgs hypotheses, using the fit procedure presented in 
Sect. 7. 

Before the training, the same preselection and some of the selection cuts listed in Table 2 are applied to 
data and on all MC predictions for background and signal. The additional selection requirements adopted 
for both the 0- and 1-jet categories arc mu <100 GeV and on pi j? for the spin-2 non-universal coupling 
models. The loosening of the mu requirement with respect to the one applied in the full event selection is 
meant to increase the number of MC events for training. In the 0-jet category a requirement > 20 GeV 
is applied while the A <f>u cut is omitted, whereas the latter is needed in the 1-jet category due to the 
large DY background. All background samples are used in the training and each one is weighted by the 
corresponding production cross-section. 


6.1 Spin analysis 

The spin analysis presented here follows closely the strategy of Ref. [3] for the 0-jet category, while the 
1-jet category has been added and is treated likewise. For each category, one BDT discriminant (called 
BDTo in the following) is trained to discriminate between the SM hypothesis and the background, and a 
second one (BDT2) to discriminate between the alternative spin-2 hypotheses and the background. This 
results in five BDT2 trainings for the alternative spin-2 models defined in Sect. 2.1.2 and one BDTo 
training for the SM Higgs boson. 

The distributions of the input variables used for BDTo and BDT 2 in the 0-jet and 1-jet categories, respect¬ 
ively, are shown in Figs. 2 and 3 (see Sect. 4.2). 

The BDT discriminant distributions (also referred to as BDT output distributions) for the 0-jet and 1-jet 
signal region are shown in Figs. 12 and 13 for the case of universal couplings and of non-universal ones 
with p^- < 125 GeV, respectively. The plots for non-universal couplings and p^- < 300 GeV are very 
similar to the ones obtained using the requirement Pj< 125 GeV except for the BSM signal distribution. 


4 A decision tree is a collection of cuts used to classify events as signal or background. The classification is based on a set of 
discriminating variables (BDT input variables) on which the algorithm is trained. The input events are repeatedly split using 
this information. At each split, the algorithm finds the variable and the optimal selection cut on this variable, that give the 
best separation between signal and background. Finally, an overall output weight (BDT output) is assigned to each event: the 
larger the weight, the more signal-like the event is classified to be. More details can be found in Ref. [10]. 
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The SM Higgs signal is normalised using the SM Higgs-boson production cross-section. Good agreement 
between data and MC simulation is observed in those distributions once the SM signal is included. 
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Figure 12: The distributions of the output of BDTq, discriminating between the SM hypothesis and the background, 
and BDT 2 , discriminating between the alternative spin-2 hypothesis and the background, in the signal region for 
the spin-2 model with universal couplings. The signal is shown for the SM Higgs-boson hypothesis with m H = 
125 GeV. The background yields are corrected with the normalisation factors determined in the control regions. 


6.2 CP analysis 

The CP analysis - which includes both the fixed-hypothesis test and the CP-mixing scan - uses only the 
0-jet category. In this case as well, two BDT discriminants are trained: the first, BDTo, is identical to the 
one described above for the spin analysis (SM Higgs-boson signal versus background, using mu, p r -(, M>u 
and mj as input variables, as shown in Fig. 2). The second BDT, however, called BDT CP in the following, 
is trained to discriminate between the SM signal and signal for the alternative hypothesis without any 
background component. The training obtained using the two pure CP-even or CP-odd hypotheses is then 
applied to all the CP-mixing scenarios. As described in Sect. 4.2, the BDT CP training uses different input 
variables: mu, A <pu, p'j and /;™ ss for the CP-even scenario, as shown in Fig. 4, and mu, A (pu, Euw and 
Apt for the CP-odd scenario, as shown in Fig. 5. 
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Figure 13: BDT () and BDT 2 output distributions in the signal region for spin-2 models with non-universal couplings. 
The signal is shown for the SM Higgs-boson hypothesis with m H = 125 GeV. The pi j! < 125 GeV selection 
requirement is applied to all signal and background processes, corrected with the normalisation factors determined 
in the control regions. 
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The different training strategy adopted for BDT CP and BDT 2 is motivated by the intrinsic difference 
between the spin and CP analyses: while, in the former case, the spin-2 signal is more background¬ 
like (its shape is similar to that of the dominant WW background), in the latter case, the different signal 
hypotheses result in shapes of the input variable distributions which are quite similar to each other, while 
they remain different from the background shape. Therefore, for the CP analysis, the best separation 
power is obtained by training BDT CP to discriminate between the SM and BSM hypotheses. 

The BDT CP output distributions for the SM versus BSM CP-odd and CP-even hypotheses are shown in 
Fig. 14. Good agreement between data and MC simulation is also found in this case, once the SM Fliggs- 
boson signal is included. 



Figure 14: Distributions of the output of BDT CP , discriminating between the SM signal and the signal for the 
alternative hypothesis, in the signal region for the SM versus BSM CP-odd (left) and SM versus BSM CP-even 
(right) hypotheses. The signal is shown for the SM Higgs-boson hypothesis with % = 125 GeV. The background 
yields are corrected with the normalisation factors determined in the control regions. 


7 Fit procedure 


This section discusses the statistical approach adopted in this paper. First, the rebinning of the two- 
dimensional BDT output distribution is discussed. The rebinning is applied for both analyses: the fixed- 
hypothesis tests and the CP-mixing analysis. Afterwards the statistical procedure for the individual ana¬ 
lyses is presented. 

The two-dimensional BDTo x BDT 2 output (or BDTo x BDT CP for the CP analysis) distribution is un¬ 
rolled row by row to a one-dimensional distribution. After the unrolling, bins with less than one back¬ 
ground event arc merged. The latter threshold is applied to the sum of weighted background events, i.e. 
after the normalisation to the corresponding cross-section and luminosity and the application of the post- 
fit scale factors to the background processes. This is done independently in the 0-jet and 1-jet categories 
and for all benchmarks and scans where a retraining of the BDT has occurred. Such a procedure is not 
intended to improve the expected sensitivity per se, rather to stabilise the fit in the presence of a large 
number of free parameters. 
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7.1 Procedure for the fixed-hypothesis test 

The statistical analysis of the data employs a binned likelihood £,{e,p, 9) constructed with one parameter 
of interest, s, which represents the fraction of SM Higgs-boson events with respect to the expected signal 
yields, and can assume only discrete values e = 0 (for the alternative ALT hypothesis) and s = 1 (for the 
SM hypothesis). 

Template histograms representing the nominal signal and background rates are used to construct X(e, p, 9), 
summing over the bins (iVbms) of the unrolled BDT output distributions, per jet category in the spin-2 ana¬ 
lysis case. Ssmj and Salt,; are the signal yields for the SM and alternative hypothesis, respectively, 
while Bj refers to the total background. Systematic uncertainties are represented through the N sys nuis¬ 
ance parameters 9 , constrained by the auxiliary measurements 2Pl(9\9), where 9 is the central value of the 
measurement. The full likelihood can then be written as: 


Alins Ays 

£(s,p,9) = P] P(Ni\n(sS SM ,i(9) + (1 - s)S alt / 9)) + Bi{9)) x \\2R{9 } m . (4) 

i i 

The analysis is designed to rely on shape information to distinguish between different signal hypotheses. 
The overall signal normalisation p is obtained from the tit and, in the case of the spin analysis, as a 
combination over both jet categories. Further details of the various likelihood terms can be found in 
Ref. [9], 

The compatibility of the data and two signal hypotheses is then estimated using a test statistic defined 
as: 


£(e = l,p s= i,9 s=l ) 

q = In-I-*- • (5) 

X(e = O,/) e= o,0£=o) 

For both the numerator and denominator, the likelihood is maximised independently over all nuisance 
parameters to obtain the maximum likelihood estimators p and 9. Pseudo-experiments for the two hypo¬ 
theses (e = 0,1) are used to obtain the corresponding distributions of the test statistic q and subsequently 
to evaluate the p-values, which define the expected and observed sensitivities for various hypotheses. The 
expected /;-values are calculated using the fitted signal strength in data, p™ p for the SM hypothesis, 
and p p for the alternative hypothesis. In addition, for the SM hypothesis the expected p-value fixing 
the signal normalisation to the SM prediction, p™ p v is given. The observed p-values, p™ s and 
are defined as the probability of obtaining a g-value smaller (larger) than the observed value under the 
SM (alternative) signal hypothesis. Pseudo-experiments are needed because the asymptotic approxima¬ 
tion [51] does not hold when the parameter of interest, s in this case, takes only discrete values (0 or 1), 
and in particular -21n(X) does not follow a^ 2 distribution. 

The confidence level (CL) for excluding an alternative BSM hypothesis in favour of the SM is evaluated 
by means of a CL estimator [52]: 


CL S = 




( 6 ) 


which normalises the rejection power of the alternative hypothesis, p ALT , to the compatibility of the data 
with the SM case, 1 - p SM . 
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7.2 Procedure for CP-mixing analysis 

The likelihood definition for the CP-mixing analysis is the same as for the spin analysis, with s = 1 
corresponding to the SM signal hypothesis and e = 0 corresponding to the alternative CP hypothesis. 
Whereas for the fixed-hypothesis test, the sensitivities are estimated by means of pseudo-experiments and 
follow the procedure explained above, for the CP-mixing analysis, the simpler asymptotic approximation 
is used, since the fraction of BSM signal events is now considered a continuous parameter. Results 
using the asymptotic approximation are cross-checked with pseudo-data for a few values of the scan 
parameter. 

The fits to data and to the MC expectation under the SM hypothesis are performed for each value of the 
scan parameter. Two fits to the SM expectation are evaluated: fixing the signal normalisation to the SM 
expectation and to the observed SM signal normalisation. From the fit, the value of the log-likelihood 
(LL) is extracted, as a function of the CP-mixing fraction. The maximum of the LL curve is determined 
and its difference from all other values is computed, -2ALL. The ler and 2cr confidence levels are then 
found at -2ALL = 1 and -2ALL = 3.84, respectively. 


8 Systematic uncertainties 

This section describes the systematic uncertainties considered in this analysis, which are divided into 
two categories: experimental uncertainties and theoretical ones which affect the shape of the BDT output 
distribution. The systematic uncertainties specific to the normalisation of individual backgrounds are 
described in Sect. 5. 


8.1 Experimental uncertainties 

The jet-energy scale and resolution and the fi-tagging efficiency are the dominant sources of experimental 
uncertainty in this category, followed by the lepton resolution, identification and trigger efficiencies and 
the missing transverse momentum measurement. The latter is calculated as the negative vector sum of 
the momentum of objects selected according to the ATLAS identification algorithms, such as leptons, 
photons, and jets, and of the remaining soft objects (referred to as soft terms in the following) that typ¬ 
ically have low values of pj [9]. The various systematic contributions taken into account in the analysis 
are listed in Table 6. More information on the experimental systematic uncertainties can be found in 
Ref. [9], 

In the likelihood fit, the experimental uncertainties are varied in a correlated way across all backgrounds 
and across signal and control regions, so that the uncertainties on the extrapolation factors a described 
in Sect. 5 arc correctly propagated. All sources in Table 6 are analysed to evaluate their impact on both 
the yield normalisation and on the shape of the BDT discriminant distributions. Shape uncertainties are 
ignored if they are smaller than 5% (smaller than the statistical uncertainty) in each bin of the distributions 
under study. Normalisation uncertainties are ignored as well if they are below 0.1%. 
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Table 6: Sources of experimental systematic uncertainty considered in the analysis. The source and magnitude of 
the uncertainties and their impact on the reconstructed objects is indicated. 


Source of uncertainty 

Treatment in the analysis and its magnitude 

Jet energy scale 

1 -7% in total as a function of jet r\ and p\ 

Jet energy resolution 

5 -20% as a function of jet r\ and p\ 

Relative uncertainty on the resolution is 2 -4091; 

^-tagging 

Z?-jet identification: 1 -8% decomposed in pj bins 

Light-quark jet misidentification: 9 -19% as a function of tj and p-\ 
c-quark jet misidentification: 6 -14% as a function of pj 

Leptons 

Reconstruction, identification, isolation, trigger efficiency: below 1% 
except for electron identification: 0.2 -2.7% depending on ;; and p-\ 

Momentum scale and resolution: <1% 

Missing transverse momentum 

Propagated jet-energy and lepton-momentum scale uncertainties 
Resolution (1.5 -3.3 GeV) and scale variation (0.3 -1.4 GeV) 

Pile-up 

The number of pile-up events is varied by 10% 

Luminosity 

2.8% [53] 


8.2 Modelling uncertainties 

The dominant background is SM WW production, and therefore uncertainties on the shape and yield in the 
signal region for this background require special attention. The uncertainties on the WW normalisation 
are discussed in Sect. 5.1; the shape uncertainties are addressed in this section. 

An important uncertainty arises from the modelling of the shape of the WW background in the signal 
region, which is obtained using the same procedure adopted in the evaluation of the theoretical uncertainty 
on the WW extrapolation parameter. The scale uncertainty on the MC prediction of the BDT discriminants 
was studied by varying the factorisation and renormalisation scales up and down by a factor of two. The 
parton shower and generator uncertainties are estimated by comparing the Herwig and Pythia parton 
shower programs and by comparing Powheg+Herwig and aMC@NLO+HERWiG, respectively. Finally, 
the PDF uncertainty is estimated by combining the CT10 PDF error set with the difference between the 
central values of NNPDF2.3 and CT10. The procedure is repeated for each of the final BDT output 
distributions and for each benchmark of the spin and parity analyses. 

Modifications to the shape of the final BDT distribution from PDF and scale variations are found to be 
negligible, and well within the statistical uncertainty of the Monte Carlo predictions. Therefore they are 
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Table 7: From top to bottom, systematic uncertainties (in %) with the largest impact on the spin-2 universal 
couplings, BSM CP-odd and CP-even Higgs-boson fixed-hypothesis tests. This ranking is based on the impact of 
each systematic uncertainty on the CL S estimator (see Sect. 7). For the exact meaning of the different uncertainties 
related to the misidentified lepton rates (the VF+jets background estimate uncertainty), see Sect. 5.4 and Ref. [9], 


Spin-2 

BSM CP-odd 

BSM CP-even 

WW generator: 

2.6 

WW generator: 

0.73 

WW UE/PS: 

21 

p^, re weighting: 

1.2 

WW UE/PS: 

0.66 

Misid. rate (elec, stats): 

9.2 

Misid. rate (elec, stats): 

1.1 

QCD scale Wg*: 

0.45 

Misid. rate (elec, flavour): 

8.4 

Misid. rate (elec, flavour): 

1.0 

pZ reweighting: 

0.43 

Misid. rate (muon flavour): 

7.4 

WW UE/PS: 

0.86 

QCD scale VV: 

0.39 

Misid. rate (muon stats): 

7.3 

Misid. rate (muon stats): 

0.81 

QCD scale Wg: 

0.38 

Misid. rate (elec, other): 

7.3 

Z/y* —> tt generator: 

0.76 

Misid. rate (elec, stats): 

0.37 

WW PDF gg-production: 

6.9 

Misid. rate (muon flavour): 

0.75 

Misid. rate (elec, other): 

0.34 

WW PDF ////-production: 

6.9 

Misid. rate (elec, other): 

0.67 

Misid. rate (elec, flavour): 

0.33 

WW generator: 

3.6 


included in the fit model only as overall normalisation effects. The parton shower and generator uncer¬ 
tainties were found to be statistically significant; therefore, a bin-by-bin shape uncertainty is applied. 

The interference between the gg —* WW and the gg —> H processes is not taken into account in this 
study because of its negligible effect. In fact it results in a 4% decrease in the total yield of events after 
the selection criteria and is of the same order as in Ref. [9]. These results confirm the expectations in 
Ref. [54], 

The signal final-state observables are affected by the underlying Higgs-boson pj distribution. The Higgs- 
boson p-\ distribution for a spin-0 particle is given by the p^-reweighted Powheg+Pythia generator pre¬ 
diction as mentioned in Sect. 3. All spin-0 samples are reweighted to the same distribution to avoid 
any impact of the difference in the Higgs-boson pj predictions between MADGRAPH5_aMC@NLO and 
Powheg on the CP-analysis results. No additional shape uncertainty is considered. For the spin-2 bench¬ 
marks no theoretical uncertainties on the Higgs-boson p ] y are considered, because they are negligible 
compared to the effect of the choice of p } j requirement in the non-universal couplings models. 


8.3 Ranking of systematics 

The impact of each systematic variation on the CL S estimator gives the measure of the relevance of the 
systematic uncertainty on the obtained result. The systematic uncertainties that are found to be most 
important in the various fixed-hypothesis tests are listed for the different cases in Table 7. 

The WW modelling uncertainty dominates in all three benchmarks, and another common large uncertainty 
is due to the IP + jets background estimate. The spin-2 and CP-odd analyses are affected by the Z/y* —> rr 
modelling uncertainty. In addition, the CP-odd analysis is impacted by the modelling uncertainties on the 
non-WW background. The impact of systematics on the CL S estimator is larger for the CP-even case than 
for other benchmarks because of the lower sensitivity of the CP-even analysis. 
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9 Results 


The results of the studies of the spin and parity quantum numbers are presented in this section. The 
SM J p = 0 + hypothesis is tested against several alternative spin/parity hypotheses, and the mixture of 
the SM Higgs and a BSM CP-even or CP-odd Higgs bosons is studied by scanning all possible mixing 
combinations. 

This section is organised as follows. The event yields and the BDT output distributions after the fit to data 
are presented in Sect. 9.1 . The results of the fixed-hypotheses tests for spin-2 benchmarks are discussed 
in Sect. 9.2 and the results for spin-0 and CP-mixed tests are shown in Sect. 9.3. 


9.1 Yields and distributions 

The post-fit yields for all signals and backgrounds are summarised in Table 8 for the spin and CP analyses. 
They account for changes in the normalisation factors and for pulls of the nuisance parameters. All the 
systematic uncertainties discussed in Table 5 and Sect. 8 are included in the fit. The fitted signal yields 
vary significantly in the BSM scenarios because of the differences in the shapes of the input variable 
distributions between the benchmark models. A striking example is given by the benchmark models 
with non-universal couplings: the fitted signal yield varies considerably between the Pj < 125 GeV and 
Pj < 300 GeV selections because of the presence of the tail at high pj values discussed in Sect. 2.1.1. 
The yield fitted under the SM hypothesis, 270 + 70 events (see Table 8), is in good agreement with the 
signal expectation of 238 events, corresponding to the ggF signal strength measured in Ref. [9]. 


9.2 Spin-2 results 

The compatibility of the spin-2 signal model with the observed data is calculated following the prescrip¬ 
tion explained in Sect. 7.1 for five different benchmarks discussed in Sect. 2.1.2. The expected distribu¬ 
tions of the test statistic q, derived from pseudo-experiments, are shown for the universal couplings case 
in Fig. 15 for 0- and 1-jet combined. The q distributions are symmetric and have no overflow or under¬ 
flow bins. The expected and observed significances and CL S are summarised in Table 9. The expected 
significance p™ - using the observed SM normalisation is higher than p™ p x , because the observed 
SM yields in Table 8 are larger than the expected SM yields in Table 3. The SM hypothesis is favoured 
in all tests in data and the alternative model is disfavoured at 84.5% CL for the model with universal 
couplings and excluded at 92.5% to 99.4% CL for the benchmark models with non-universal couplings. 
The exclusion limits for non-universal couplings are stronger for a p ] j cut above 300 GeV because of the 
enhanced sensitivity at high values of the Higgs-boson pj. 

The one-dimensional distribution of the unrolled post-fit BDT output distribution is presented in Fig. 16 
for the K g - 1, K q - 0 and pi j? < 125 GeV scenario in the 0-jet case. The distributions are shown for 
the SM and alternative signal hypotheses separately and compared with the data after the subtraction 
of all backgrounds. Both the signal and background yields are normalised to the post-fit values. The 
distributions are ordered in terms of increasing signal yield and, for visualisation purposes, only contain 
bins that have at least three signal events and a signal-to-background ratio of at least 0.02. 
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Figure 15: Test-statistic distribution for the spin-2 benchmark with universal couplings (k (j = K q ) including all 
systematic uncertainties, with 0- and 1-jet categories combined. The median of the expected distributions for the 
SM (dashed red line) and the spin-2 Higgs-boson signal (dashed blue line) is also shown, together with the observed 
result (solid black line) from the fit to the data. The shaded areas are used to compute the observed //-values. 
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Figure 16: The unrolled one-dimensional BDT output after background subtraction and using post-fit normalisa¬ 
tions, in the case of the spin-2 benchmark with non-universal couplings (k cj = 1, K q = 0), requiring the Higgs-boson 
pi to be below 125 GeV. The background yields are taken from the fit results, assuming the SM signal hypothesis 
in the left-hand plot, and the alternative spin-2 hypothesis in the right-hand plot. 
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Table 8: Post-fit event yields for the 0- and 1-jet categories for various signal hypotheses. The number of events 
observed in data, the signal and the total background yields, including their respective post-fit systematic uncertain¬ 
ties, are shown in the top part of the table, assuming in each case the alternative signal hypothesis. The spin-2 K g = 
K q benchmark is used as an example in the bottom part of the table, to show in more detail the results under the SM 
Higgs-boson hypothesis. For this fit, the individual backgrounds are listed for completeness (see Sect. 5). 


Benchmark 

Signal 

0-jet 1-jet 

Total background 

0-jet 1-jet 

II 

360 ± 100 126 ± 34 

4370 ± 240 1430 ± 60 

K g = 0.5, K q = 1, pj < 125 GeV 
K g = 0.5, K q = 1, pj < 300 GeV 

300 ± 100 103 ± 33 

230 ± 80 82 ± 29 

4430 ± 240 1390 ± 60 

4490 ± 230 1460 ± 70 

Kg = 1, Kg = 0, < 125 GeV 

Kg = 1, Kg = 0, Pj < 300 GeV 

320 ±90 111 ±32 

200 ± 80 71 ±28 

4410 ±240 1390 ±60 

4520 ± 240 1480 ± 70 

BSM CP-odd 

240 ±80 

4490 ± 260 

BSM CP-even 

180 ±60 

4530 ± 240 



Data 

Signal Tot. bkg. 

1F1F Top DY VF+jets Other 

SM 0-jet 
SM 1-jet 

4730 

1569 

270 ± 70 4460 ± 240 

95 ± 26 1450 ± 70 

2904 376 464 370 345 

607 355 233 124 133 


9.3 Spin-0 and CP-mixing results 

Similar to the spin-2 fixed-hypothesis tests, the CP-even BSM Higgs and the CP-odd BSM Higgs-boson 
hypotheses are tested against the SM Higgs-boson hypothesis. The expected distributions of the test 
statistic q, derived from pseudo-experiments for the SM versus BSM CP-odd and CP-even pure states, 
are shown in Fig. 17. The distributions are symmetric and have no overflow or underflow bins. The 
overlap of the test-statistic distributions for the SM hypothesis and the alternative hypothesis indicates 
the sensitivity of the analysis to distinguish them. The expected sensitivity is higher for the CP-odd 
hypothesis than for the CP-even hypothesis. The expected and observed significances and CL S values 
are summarised in Table 9. The expected significances p ™ p ^ and are similar, because the 

observed and the expected SM yields are similar for the spin-0 fixed hypothesis test. The SM hypothesis 
is favoured in both tests and the alternative hypothesis can be excluded at 96.5% CL for the CP-odd Higgs 
boson and disfavoured at 70.8% CL for the CP-even BSM Higgs boson. 

The unrolled BDT output distributions normalised to the post-fit values are shown in Fig. 18. These dis¬ 
tributions show the one-dimensional unrolled BDT output for the SM and alternative signal hypotheses 
separately and compare them with the data after background subtraction. Both the signals and the back- 
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Table 9: Summary of expected and observed sensitivities for various alternative spin/CP benchmarks compared to 
the SM Higgs-boson hypothesis. The expected and observed p-values and the observed 1 - CL S value as defined in 
Sect. 7 are shown for various benchmarks. The results are computed taking into account systematic uncertainties, 
using the combined 0-jet and 1-jet categories for the spin analysis and only the 0-jet category for the CP analysis. 


Channel 

SM 

^ exp, fi- 1 

P SM . 7 

exp, /-i=fi 

-ALT 

exp, fi=p. 

pZ 


i 

n 

r 

t/3 



Spin-2, 

S3" 

II 




0+1-jet 

0.131 

0.039 

0.033 

0.246 

0.117 

84.5% 


Spin-2, K g = 0.5, k ci 

= L Pj 

< 125 GeV 


0+1-jet 

0.105 

0.047 

0.022 

0.685 

0.007 

97.8% 


Spin-2, K g = 0.5, K q 

= L Pj 

< 300 GeV 


0+1-jet 

0.023 

0.014 

0.004 

0.524 

0.003 

99.3% 


Spin-2, 

ii 

* 

= 0, pH < 125 GeV 


0+1-jet 

0.109 

0.041 

0.029 

0.421 

0.044 

92.5% 


Spin-2, 

ii 

* 

= 0, p* < 300 GeV 


0+1-jet 

0.015 

0.016 

0.004 

0.552 

0.003 

99.4% 

BSM CP-odd 

0-jet 

0.078 

0.062 

0.032 

0.652 

0.012 

96.5% 

BSM CP-even 

0-jet 

0.271 

0.310 

0.287 

0.907 

0.027 

70.8% 
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Figure 17: Test-statistic distribution for the pure BSM CP-odd (left) and BSM CP-even (right) benchmarks, includ¬ 
ing all systematic uncertainties. The median of the expected distributions for the SM (dashed red line) and the BSM 
Higgs-boson signal (dashed blue line) is also shown, together with the observed result (solid black line) from the fit 
to the data. The shaded areas are used to compute the observed p-values. 
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Figure 18: The unrolled one-dimensional BDT output after background subtraction in the case of the pure BSM CP- 
odd (top) and BSM CP-even (bottom) benchmarks. The background yields are taken from the fit results, assuming 
the SM signal hypothesis in the left-hand plots, and the alternative hypothesis in the right-hand plots. 
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Figure 19: The BSM CP-odd (top) and BSM CP-even (bottom) mixing scan results. The top row shows the full 
CP-odd scan (left) and the region around the minimum enlarged (right). The 68% and 95% CL exclusion regions 
are indicated as lying above the corresponding horizontal lines. 


ground yields are normalised to the post-fit values. The distributions are ordered by increasing signal, 
and they contain bins that have at least three signal events and are above a signal-to-background threshold 
(S/B) of 0.035. As already mentioned above, these plots are intended for illustrative purposes only. The 
figure shows that the SM Higgs-boson hypothesis is preferred over the pure BSM CP-even or CP-odd 
cases. The S/B ratio used for the CP analysis is higher than the one used for the spin-2 analysis because 
on average the bins with the highest significance have a higher S/B in the CP-mixing than in the spin-2 
BDT output. 

The compatibility of the CP-mixed signal plus background with the observed data is calculated following 
the prescription explained in Sect. 7.2 for the two different scans (mixing of an SM Higgs boson with a 
BSM CP-even or CP-odd boson) as discussed in Sect. 2.2.1. The scan results are presented in Fig. 19. 

In the case of the BSM CP-odd mixing scan (top row of Fig. 19), the expected and observed curves arc 
slightly asymmetric, but the sensitivity to the sign of the scan parameter is small. Due to higher observed 
yields for the SM hypothesis, the expected curve using the observed yields (ju = fi) is above the expected 
curve for the yields fixed to the SM expectation (ju = 1). The minimum of the -2ALL curve is very broad 
and lies at -0.2. The value at 0 corresponds to the SM hypothesis. The values of (/r AWW //r SVI ) • tan a below 
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-6 and above 5 can be excluded at 95% CL, while values below -1.6 and above 1.3 at 68% CL. The fitted 
signal yields and their relative uncertainties, for the SM and alternative signal hypotheses, are very stable 
throughout the scan. They are given in Table 8 for the fixed-hypothesis case. 

The plot on the bottom of Fig. 19 shows the result of the BSM CP-even scan as a function of £ H ww/ksm- 
The separation power between the SM Higgs-boson hypothesis and the BSM CP-even mixed hypothesis 
is enhanced in the region around -1, the observed minimum of the -2ALL distribution, because of the 
interference effect explained in Sect. 4.2. The fitted signal yield, both for the SM and alternative signal 
hypotheses, is stable for values outside the observed minimum region and similar to the values given in 
Table 8 for the fixed-hypothesis case. In the region around the minimum, the fitted BSM signal yield is 
higher, reaching about 370 events. These variations are expected from the significant shape differences of 
the input variable distributions in this region of the parameter scan, as described in Sect. 4.2. The relative 
uncertainty is stable throughout the scan, with values around 30%. 

The observed minimum of the -2ALL curve is at -1.3 and is compatible with the SM hypothesis within 
1.9cr. To further study the compatibility of the SM signal hypothesis with the observed result, sev¬ 
eral scans are performed, by fitting, instead of the real data, pseudo-data generated around the expected 
signal-plus-background post-fit BDT distribution. This means that the nuisance parameters from this test 
are obtained from the fit of the SM signal to the data. Distributions similar to the one observed in the data 
are reproduced by pseudo-data. Furthermore, a fixed-hypothesis test is also performed, where the com¬ 
patibility of the observed data with the SM Fliggs boson versus the CP-even mixed signal corresponding 
to %ww/*"sm = —1-3 is studied, resulting in a 1 - CL S of 43% in favour of the SM and of 93% in favour of 
the alternative hypothesis. 

Values of the mixing parameter, Liww/Vu- above 0.4 and below -2.2 can be excluded at 95% CL, as well 
as in the region between -0.85 and -1. Values above -0.5 and below -1.5, as well as between -1.2 and 
-0.65, can be excluded at 68% CL. 


10 Conclusions 


The Standard Model J p - 0 + hypothesis for the Fliggs boson is compared to alternative spin/parity 
hypotheses using 20.3 fb -1 of the proton-proton collision data collected by the ATLAS experiment at the 
LHC at \fs =8 TeV and corresponding to the full data set of 2012. The Higgs-boson decay WW* —> ev/jv 
is used to test several alternative models, including BSM CP-even and CP-odd Higgs bosons, and a 
graviton-inspired J p = 2 + model with minimal couplings to the Standard Model particles. In addition to 
the tests of pure J p states, two scenarios are considered where all the CP mixtures of the SM Higgs boson 
and a BSM CP-even or CP-odd Higgs boson are tested. 

For the spin-2 benchmarks, the SM hypothesis is favoured in all tests in data and the alternative model is 
disfavoured at 84.5% CL for the model with universal couplings and excluded at 92.5% to 99.4% CL for 
the benchmark models with non-universal couplings. 

The SM Higgs-boson hypothesis is tested against a pure BSM CP-even or CP-odd Higgs-boson hypo¬ 
thesis: the results prefer the SM Higgs-boson hypothesis, excluding the alternative hypothesis at the 
70.8% and 96.5% levels, respectively. 

The data favour the Standard Model quantum numbers in all cases apart from the scan of a CP-mixed 
state with a BSM CP-even Higgs boson, where the data prefer a mixed state with Lww/^sh = -1.3, which 
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is compatible with the SM hypothesis within 1.9 cr. The %ww/a"sm values can be excluded at 95% CL 
above 0.4 and below -2.2, as well in the region between -0.85 and -1. For the mixing with a BSM 
CP-odd Higgs boson, the (a: aww //c sm ) • tan or values above 5 and below -6 can be excluded at 95% CL. 
The preferred value corresponds to (ic AWW //c SM ) • tan a = -0.2, which is compatible with the SM to within 
0.5cr. 
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